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1 Introduction 

Let DifF (M) be the space of C r diffeomorphisms of a finite-dimensional smooth compact manifold 
M with the uniform C r -topology, where dim M > 2, and let / E DifF(M). Consider the number 
of periodic points of period n 

Pn(f) = #{x G M : x = f n (x)}. (1) 

The main question of this paper is: 

Question 1. How quickly can P n (f) grow with n for a "generic" C r diffeomorphism f? 

We put the word "generic" in brackets because as the reader will see the answer depends on 
notion of gencricity. 

For technical reasons one sometimes counts only isolated points of period n; let 

Pn(f) = #{•"£ £ M : x = f n (x) and y ^ f n {y) for y ^ x in some neighborhood of x}. (2) 

We call a diffeomorphism / £ DifF (M) an Artin-Mazur diffeomorphism (or simply A-M diffeo- 
morphism) if the number of isolated periodic orbits of / grows at most exponentially fast, i.e. 
for some number C > 0, 

P l n {f) < exp(CV) for all n E Z+. (3) 



Artin & Mazur [AM proved the following result. 



Theorem 1.1. For < r < oo, A-M diffeomorphisms are dense in Dijf(M) with the uniform 
C r -topology. 

We say that a point x G M of period n for / is hyperbolic if df n (x) , the derivative of /" at 
x, has no eigenvalues with modulus 1. (Notice that a hyperbolic solution to f n (x) = x must also 
be isolated.) We call / G DifF(M) a strongly Artin-Mazur diffeomorphism if for some number 
C > 0, 

P n {f) < exp(CV) for all n E Z+, (4) 



and all periodic points of / are hyperbolic (whence P n (f) — Pn(f))- I n |K1] an elementary proof 
of the following extension of the Artin-Mazur result is given. 
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Theorem 1.2. For < r < oo, strongly A-M diffeomorphisms are dense in Diff(M) with the 
uniform C r -topology. 

According to the standard terminology, a set in DifF(M) is called residual if it contains a 
countable intersection of open dense sets and a property is called (Baire) generic if diffeomor- 
phisms with that property form a residual set. It turns out the A-M property is not generic, as 
is shown in [K2]. Moreover: 



Theorem 1.3. For any 2 < r < oo there is an open set M C Diff (M) such that for any 

given sequence a — {a„}„ e z + there is a Baire generic set lZ a in M depending on the sequence a n 



with the property if f G lZ a , then for infinitely many nu G we have P^Af) > o> 



Of course since P n (f) > P^.(f)> the same statement can be made about P n (f). But in fact it is 
shown in [K2] that P n (f) is infinite for n sufficiently large, due to a continuum of periodic points, 
for at least a dense set of / 6 Af. 

The proof of this Theorem is based on a result of Gonch enko -Shil nikov-T uraev |GST1 1. Two 
slightly different detailed proof s of the ir result are given in K2] and [GST2]. The proof in [K2| 
relies on a strategy outlined in GST1 |. 

However, it seems unnatural that if you pick a diffeomorphism at random then it may have an 
arbitrarily fast growth of number of periodic points. Moreover, Baire generic sets in Euclidean 
spaces can have zero Lebesgue measure. Phenomena that are Baire generic, but have a small 
probability are well-known in dynamical systems, KAM theory, number theory, etc. (seefOj, 
[HSY], [K3 for various examples). This partially motivates the problem posed by Arnold flM: 



Problem 1. Prove that "with probability one" f G DifF(M) is an A-M diffeomorphism. 

Arnold suggested the following interpretation of "with probability one" : for a ( Baire ) generic 
finite parameter family of diffeomorphisms {f e }> for Lebesgue almost every e we have that f E is A- 
M (cf. [K3 ). As Theorem 1.3 shows, a result on the genericity of the set of A-M diffeomorphisms 
based on (Baire) topology is likely to be extremely subtle, if possible at allQ. We use instead 
a notion of "probability one" based on prevalence [HSY, |K3|], which is independent of Baire 



genericity. We also are able to state the result in the form Arnold suggested for generic families 
using this measure-theoretic notion of genericity. 

For a rough understanding of prevalence, consider a Borcl measure /iona Banach space V. 
We say that a property holds "/i-almost surely for perturbations" if it holds on a Borel set P C V 
such that for all v G V we have v + w G P for almost every w with respect to Notice that 
if V = M. k and /i is Lebesgue measure, then "/i-almost surely for perturbations" is equivalent to 
"Lebesgue almost everywhere". Moreover, the Fubini/Tonelli Theorem implies that if /i is any 
Borel probability measure on M. k , then a property that holds /i-almost surely for perturbations 
must also be hold Lebesgue almost everywhere. Based on this observation, we call a property on a 
Banach space "prevalent" if it holds /i-almost surely for perturbations for some Borel probability 
measure fi on V, which for technical reasons (cf. |HSY|) we require to have compact support. 
In order to apply this notion to the Banach manifold DifF(A/), we must describe how we make 
perturbations in this space, which we will do in the next Section. 



1 For example, using techniques from [GST2| and [K2 one can prove that for a Baire generic finite-parameter 
family {/ e } and a Baire generic parameter value s the corresponding diffeomorphism / e is not A-M. Unfortunately, 
how to estimate from below the measure of non-A-M diffeomorphisms in a Baire generic finite-parameter family 
is so far an unreachable question. 



2 A similar notion of prevalence is used in [ V^K 
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Our first main result is a partial solution to Arnold's problem. It says that for a prevalent 
diffeomorphism f G DifF(M), with 1 < r < oo, and all 5 > there exists C = C(S) > such 
that for all n G Z +; 



P n (f) < eMCn 1+s ). (5) 



The Kupka-Smale theorem (see e.g. ]PMj ) states that for a generic diffeomorphism all peri- 
odic points are hyp erbo lic and all associated stable and unstable manifolds intersect one another 
transversally. Ref. [|K3| shows that the Kupka-Smale theorem also holds on a prevalent set. So, 
the Kupka-Smale theorem, in particular, says that a Baire generic (resp. prevalent) diffeomor- 
phism has only hyperbolic periodic points, but how hyperbolic are the periodic points, as function 
of their period, for a Baire generic (resp. prevalent) diffeomorphism f? This is the second main 
problem we deal with in this paper. 

Recall that a linear operator L : R N — > R w is hyperbolic if it has no eigenvalues on the unit 
circle {\z\ = 1} C C. Denote by | ■ | the Euclidean norm in C . Then we define the hyperbolicity 
of a linear operator L by 

j(L) = inf inf \Lv — exp(27ri0)v|. (6) 

0<E[O,1) \v\=l 

We also say that L is 7-hyperbolic if ^f(L) > 7. In particular, if L is 7-hyperbolic, then its 
eigenvalues {XjjjLx C C are at least 7-distant from the unit circle, i.e. mhij ||Aj| — 1| > 7. 
The hyperbolicity of a periodic point x = f n {x) of period n, denoted by 7„(a:,/), equals the 
hyperbolicity of the derivative df n (x) of /" at points x, i.e. 7,1(2;,/) = j(df n (x)). Similarly to 
the number of periodic points P n {f) of period n, define 

7n(/) = r mm j n (x,f). (7) 

{x: x=f n (x)\ 

The idea of Gromov g and Yomdin Jy) of measuring hyperbolicity is that a 7-hypcrbolic 
point of period n of a C 2 diffeomorphism / has an M2~ 2ri 7-neighborhood (where M2 = ||/||c 2 ) 
free from periodic points of the same period^. One can prove the following slightly more general 
result. 

Proposition 1.1. Let M be a compact manifold of dimension N, let f : M — > M be a C 1+p 
diffeomorphism (where < p < 1 ) that has only hyperbolic periodic points, and let M\ +p = 
max(||/|| c i+ P , 2 1 / p ). Then there is a constant C = C(M) > such that for each n G Z + we have 

Pn(f) < C (M 1+P ) nN{1+p)/p ln (f)- N/p . (8) 



Proposition 1.1 implies that a lower estimate on the decay of hyperbolicity 7«(/) gives an 
upper estimate on the growth of the number of periodic points P n (f). Therefore, a natural 
question is: 

Question 2. How quickly can "f n (f) decay with n for a "generic" C r diffeomorphism f? 

3 In [Y| hyperbolicity is introduced as the minimal distance of eigenvalues to the unit circle. This way of defining 
hvperb olicit v does not guarantee the existence of a M^~ 2n 7- neighbor hood free from periodic points of the same 



period [KH 
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The existence of lower bound on a rate of decay of 7 n (/) f° r Baire generic / 6 DifF (M) would 
imply the existence of an upper bound on a rate of growth of the number of periodic points P n (/) , 



whereas no such bound exists by Theorem 1.3. Thus again we consider genericity in the measure- 



theoretic sense of prevalence. Our second main result, which in view of Proposition 1.1 implies 
the first main result, is that for a prevalent diffeomorphism f G DifF(Af), with 1 < r < oo, and 
all S > there exists C = C{S) > such that 

7n(/) > exp(-Cn 1+s ). (9) 

Now we shall discuss in more detail our definition of prevalence ("probability one") in the 
space of diffeomorphisms DifF ( Af ) . 

2 Prevalence in the Space of Diffeomorphisms DifF(M) 

The space of C r diffeomorphisms DifF(Af) of a compact manifold M is a Banach manifold. 
Locally we can identify it with a Banach space, which gives it a local linear structure in the 
sense that we can perturb a diffeomorphism by "adding" small elements of the Banach space. 
As we described in the previous section, the notion of prevalence requires us to make additive 
perturbations with respect to a probability measure that is independent of the place that we 
make the perturbation. Thus although there is not a unique way to put a linear structure on 
DifF (M), it is important to make a choice that is consistent throughout the Banach manifold. 

The way we make perturbations on DifF (M) by small elements of a Banach space is as 
follows. First we embed M into the interior of the closed unit ball B N c R , which we can 



do for N sufficiently large by the Whitney Embedding Theorem [W|. We emphasize that our 
results hold for every possible choice of an embedding of M into R-^We then consider a closed 
tube neighborhood U C B of M and the Banach space C r (U, M. N ) of C r functions from U to 
R N . Next, we extend every element / € DifF (M) to an element F G C r (U,R N ) that is strongly 
contracting in the directions transverse to M. Again the particular choice of how we make this 
extension is not important to our resu lts; in the Appendix we describe the conditions we need to 
ensure that the results of Sacker Sac ] and Fenichel [FJ apply as follows. Since F has M as an 
invariant manifold, if we add to F a small perturbation in g G C r (U, M ), the perturbed map 
F + g has an invariant manifold in U that is close to M. Then F + g restricted to its invariant 
manifold corresponds in a natural way to an element of DifF (A/), which we consider to be the 
perturbation of / G DifF(M) by g G C r (U,R N ). The details of this construction are described 
in the Appendix. 

In this way we reduce the problem to the study of maps in DifF(t/), the open subset of 
C r (£7, R ) consisting of those elements that are diffeomorphisms from U to some subset of its 
interior. The construction we described in the previous paragraph ensures that the number of 
periodic points P n (f) and their hyperbolicity 7 n (/) for elements of DifF (M) are the same for the 
corresponding elements of DifF([/), so the bounds that we prove on these quantities for almost 
every perturbation of any element of DifF (U) hold as well for almost every perturbation of any 
element of DifF(M). Another justification for considering diffeomorphisms in Euclidean space is 
that the problem of exponential/superexponential growth of the number of periodic points P n (f) 
for a prevalent / G DifF (M) is a local problem on M and is not affected by a global shape of M. 

The results stated in the next section apply to any compact domain U C but for simplicity 
we state them for the closed unit ball B N . In the previous section, we said that a property is 
prevalent on a Banach space such as C r (B N , M. N ) if it holds on a Borel subset S for which there 
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exists a Borel probability measure fi on C r (B N , M. N ) with compact support such that for all 
F G C r (B N , M. N ) we have F + g G S for almost every g with respect to /i. The complement of 
a prevalent set is said to be shy. We then say that a property is prevalent on an open subset of 
C r (B N ,t N ) such as DiS r (B N ) if the exceptions to the property in DifF(_B JV ) form a shy subset 
oiC r (B N ,R N ). 

In this paper the perturbation measure \x that we use is supported within the analytic func- 
tions in C r (B N , R N ). In this sense we foliate DiS r (B N ) by analytic leaves that are compact and 
overlapping. The main result then says that for every analytic leaf L C DifF (B N ) and every 
S > 0, for almost every diffeomorphism f G L in the leaf L both (^j and are satisfied. Now 
we define an analytic leaf as a "Hilbert brick" in the space of analytic functions, and a natural 
Lebesgue product probability measure n on it. 



3 Formulation of Main Results 

Fix a coordinate system x — (x\, . . . , xjv) G M. n D B n and the scalar product (x, y) — Xjj/j. 
Let a = (ai,...,ajy) be a multiindex from and let \a\ = For a P om t x = 

(xi, . . . ,xn) G M. n we write x a = YiiLi x t*- Associate to a real analytic function 4> '■ B N — > l w 
the set of coefficients of its expansion: 

<t>e{x) = ^X a . (10) 

Denote by W k ,N the space of TV-component homogeneous vector-polynomials of degree k in N 
variables and by v(k,N) = diruWj^iv the dimension of Wk,N- According to the notation of the 
expansion (|lo|), denote coordinates in Wk,N by 

£fc = ({e a }\ a \=k) e W k ,N- (11) 

In Wk,N we use a scalar product that is invariant with respect to orthogonal transformation of 
R N d'b n , defined as follows: 



c?Cfc)k= X/ (a) ^ Q '^' ll^felU — ((sfci £fe)fc) 



(£fc,Cfc)fe= ^ (I) (£a,Ca), ||£fc|U = {{£k,e k ) k ) 1/2 . (12) 
»|=fc 

Denote by 

SjfeV) = {£fc e Wfc,jv : ||£fc||fc<r} (13) 

the closed r-ball in Wk,N centered at the origin. Let Leb k ,N be Lebesgue measure on Wk,N 
induced by the scalar product (|l^ ) and normalized by a constant so that the volume of the unit 
ball is one: Leb k , N (Bj? (1)) = 1. 

Fix a nonincreasing sequence of positive numbers r = ({rk}kLo) such that r& — ► as k oo 
and define a Hilbert brick of size r 

HB N (r) ={e= {e a } aeZ N : for all k G Z+, ||4IU < r fc } 

(14) 

=B w (r ) x (n) x • • • x Bf (rjt) x • • • c W ,jv x W 1>JV x • • • x W fclJV x . . . . 
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Define a product Lebesgue probability measure p,^ associated to the Hilbert brick HB N (r) of size 
r by normalizing for each k G Z + the corresponding Lebesgue measure Lebk t N on Wk,N to the 
Lebesgue probability measure on the r fe -ball B% (r fe ): 

<r = ^ Kfc,JV) and ^ = xgl /i£ rfc . (15) 

Definition 3.1. Let f G DifF(S JV ) &e a C r diffeomorphism of B N into its interior. We call 
HB N (r) a Hilbert brick of an admissible size r = ({rfe}jE_ ) respect to f if: 

A) for each e G HB N (r), the corresponding function 4>g(x) — X^aez™ £a% a * s analytic on B N ; 

B) for each e G HB N (r), £/ie corresponding map fg{x) = fix) + $g(x) is a diffeomorphism 
from B N into its interior, i.e. {f £ }e<EHB N (r) 

C DifF (B N ); 

C) for all 8 > and aH C > 0, the sequence r^ exp(Cfc 1+(5 ) —* oo as k — > oo. 

Remark 3.1. XTie /irsi and second conditions ensure that the family {ft}e£HB N {r) ^ e * n arl 
analytic leaf within the class of diffeomorphisms Y)\W{B N ). The third condition provides us 
enough freedom to perturb. It is important for our method to have infinitely many parameters 
to perturb. If Tk 's decayed too fast to zero it would make our family of perturbations essentially 
finite- dimensional. 

An example of an admissible sequence r = ({rk}kLo) is ffc = r/k\, where r depends on / 
and is chosen sufficiently small to ensure that condition (B) holds. Notice that the diameter of 
HB N (r) is then proportional to r, so that r can be chosen as some multiple of the distance from 
/ to the boundary of DifF(S JV ). 

Main Theorem. For any < p < oo and any C 1+p diffeomorphism f 6 DiS 1+p (B N ) , consider 
a Hilbert brick HB N (r) of an admissible size r with respect to f and the family of analytic 
perturbations of f 

{fei x ) = f( x ) + <Pe(x)}^ eH Bf(r) (16) 

with the Lebesgue product probability measure fi^ associated to HB N (r). Then for every 5 > 
and for /j,^ -a.e. £ there is C — C(e, S) > such that for all n G Z+ 

7 „(/ r ) > exp(-Cn 1+s ) P n (fe) < exp(Cn 1+5 ). (17) 



Remark 3.2. The fact that the measure fj,^ depends on f does not conform to our definition 
of prevalence. However, we can decompose DiS r (B N ) into a nested countable union of sets Sj 
that are each a positive distance from the boundary of DifF (B N ) and for each j G Z + choose an 
admissible sequence Tj that is valid for all f G Sj. Since a countable intersection of prevalent 
subsets of a Banach space is prevalent [HSY], the Main Theorem implies the results stated in 



terms of prevalence in the introduction. 

In the Appendix we deduce from the Main Theorem the following result. 

Theorem 3.1. Let {f £ }eeB™ C Diff 1+P (M) be a generic m-parameter family of C 1+p diffeo- 
morphisms of a compact manifold M for some p > 0. Then for every 6 > and a.e. e G B m 
there is a constant C = C(e,S) such that fiP\) is satisfied for every n G Z+. 
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In the Appendix we also give a precise meaning to the term generic. 
Let us formulate the most general result we shall prove. 

Definition 3.2. Let 7 > and f £ DiS 1+p (B N ) be a C 1+p diffeomorphism for some p > 0. 
A point x £ B N is called (n,^) -periodic if \\f n {x) — x\\ < 7 and (ri,7) -hyperbolic if 7„(.t, f) = 
l(df n (x))> 1 . 

(Notice that a point can be (n, 7)-hyperbolic regardless of its periodicity, but this property is of 
interest primarily for (n, 7)-periodic points.) For positive C and 6 let 7„(C, S) = exp(— Cn 1+S ). 

Theorem 3.2. Given the hypotheses of the Main Theorem, for every 5 > and for p^-a.e. e 
there is C = C(e,S) > such that for all n £ Z + , every {n,^fl/ p (C, 5)) -periodic point x G B N 
is (ri,7 ra (C, 6)) -hyperbolic. (Here we assume < p < 1; in a space Diff 1+P (B N ) with p > 1, the 
statement holds with p replaced by 1.) 



This result together with Proposition 1.1 implies the Main Theorem, because every periodic point 



of period n is (n, 7)-periodic for all 7 > 0. 



Remark 3.3. In the statement of the Main Theorem and Theorem 3.i the unit ball B N can be 
replaced by a bounded open set U C R N . After scaling, U can be considered as a subset of the 
unit ball B N . 

One can define a distance on a compact manifold M and almost periodic points of diffeomor- 
phisms of M. Then one can cover M = UiUi by coordinate charts and define hyperbolicity for 
almost periodic points using these charts {Ui}i (see [M for details). This gives a precise meaning 
to the following result. 

Theorem 3.3. Let {f s } e eB m C Diff 1+p (Af) be a generic m-parameter family of diffeomorphisms 
of a compact manifold M for some p > 0. Then for every S > and almost every e £E B m there 
is a constant C ~ C{e,8) such that every (n,^fU p (C, 5)) -periodic point x in B N is (n, 7„(C, <5))- 
hyperbolic. (Here again we assume < p < 1, replacing p with 1 in the conclusion if p > 1.) 



The meaning of the term generic is the same as in Theorem 3.1 and is discussed in the 
Appendix. 



4 Formulation of the main result in the 1-dimensional case 

The proof of the main result about estimating the rate of growth of the number of periodic points 
for diffeomorphisms in N dimensions has a lot of complications related to multidimcnsionality. 
To describe a model which is from one side nontrivial and from another side is useful for under- 
standing the general technique we apply our method for the 1-dimcnsional maps. The statement 
of the main result for the 1-dimensional maps has another important feature: it explains the 
statement of the main multidimensional result. 

Fix the interval / = [—1,1]. Associate to a real analytic function </>:/—> K the set of 
coefficients of its expansion 

00 

k=0 
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For a nonincreasing sequence of positive numbers r = {{rk}kL ) such that — > as k — > oo 
following the multidimensional notations we define a Hilbert brick of size r 

HB 1 (r) = {e = {e k }%L : for all k G Z+, |e fc | < r fc } (19) 

and the product probability measure /zj associated to the Hilbert brick _ff_B 1 (r) of size r which 
considers each Ek to be an independent random variable uniformly distributed on [— r^, r^]. 
Main 1-dimensional Theorem. For any < p < oo and any C 1+p map f : I —> I of the 

interval I = [— 1, 1] consider a Hilbert brick HB 1 ^) of an admissible size r tuif/j respect to f and 
the family of analytic perturbations of f 

{fe(x) = f(x) + <t> s (x)htHBi(r) (20) 

with the Lebesgue product probability measure /ij associated to HB 1 ^). Then for every S > 
and /ij-a.e. e there is C — C(e, 5) > suc/i i/ia< for all n £ Z+ 

7„(/ e )> expt-Cn^ 5 ), P n (/ E ) < exp(Cn 1+5 ). (21) 



In [MM£] Martens-de Melo-Van Strien prove in a sense a stronger statement for C 2 maps. 
They show that for any C 2 map / of an interval without "flat" critical points there are some 
7 > and uq G Z + such that for any n > no we have 7 n (/) > 1 + 7- This also implies that the 
number of periodic points is bounded by an exponential function of the period. The notion of a 



fiat critical point used in [MMS| is a nonstandard one from a point of view of singularity theory. 
For C 2 maps, they call xq a flat critical point of / if f'(xo) = f"(xo) = 0; the distance from f{x) 
to f(xo) does not have to decay to as x — > xq faster than any power of x — xq. 

In |KK| an example of a C 2 -unimodal map with a critical point having tangency of order 4 and 
an arbitrarily fast rate of growth of the number of periodic points is given. Another advantage 
of the Main 1-dimcnsional Theorem is that it works for C 1+p maps with < p < 1, whereas the 



result in [MMS] works only for C 2 maps. 



5 Strategy of the proof 

Here we describe the strategy of the proof of Theorem \i.2\ The basic technique is developed 



and many of the technical difficulties are resolved by the first author in [K4|. The general idea 
is to fix C > and prove an upper bound on the measure of the set of "bad" parameter values 
e € HB N (r) for which the conclusion of the theorem does not hold. The upper bound we obtain 
will approach zero as C — > oo, from which it follows immediately that the set of e E HB N (r) 
that are "bad" for all C > has measure zero. For a given C > 0, we bound the measure of 
"bad" parameter values inductively as follows. 

Stage 1. We delete all parameter values e £ HB N (r) for which the corresponding diffeomor- 
phism fg has an almost fixed point that is not sufficiently hyperbolic, and bound the measure of 
the deleted set. 

Stage 2. After Stage 1, we consider only parameter values for which all almost fixed points 
are sufficiently hyperbolic. Then we delete all parameter values e for which fg has an almost 
periodic point of period 2 which is not sufficiently hyperbolic and bound the measure of that set. 

Stage n. We consider only parameter values for which all almost periodic point of period at 
most n — 1 are sufficiently hyperbolic (we shall call this the Inductive Hypothesis). Then we 
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delete all parameter values e for which fg has an almost periodic point of period n which is not 
sufficiently hyperbolic and bound the measure of that set. 

The main difficulty in the proof is then to find a bound on the measure of "bad" parameter 
values at stage n such that the bounds are summable over n and that the sum approaches 
zero as C — > oo. Let us formalize the problem. Fix positive p, 5, and C, and recall that 
{ln{C,8) = exp(— Cn 1+S )} for n G Z+. Assume p < 1; if not, change its value to 1. 

Definition 5.1. A diffeomorphism f G Di& 1+P (B N ) satisfies the Inductive Hypothesis of order n 
with constants (C, 6, p), denoted f G IH(n, C, 8, p), if for all k <n, every (k, "/^ P (C, 5))-periodic 
point is (fc,7fe(C, <5)) -hyperbolic. 

For / G Diff 1+P (M), consider the sequence of sets in the parameter space HB N (r) 

B n (C, S, p, r, f) = {ee HB N (v) : fg G IH(n - 1, C, 5, p) but fg £ IH(n, C, 5, p)} (22) 

In other words, B n (C, S, p, r, /) is the set of "bad" parameter values e* € HB N (r) for which all 
almost periodic points of fg with period strictly less than n are sufficiently hyperbolic, but there 
is an almost periodic point of period n that is not sufficiently hyperbolic. Let 

M 1= sup maxdlMlci.H/^IIci}; M 1+p = sup max{||/ r || c i +P , M u 2^}. (23) 

e£HB N (r) e£HB N (r) 

Our goal is to find an upper bound (x n (C, 5, p, r, M 1+p ) for the measure p,^ (B n (C, 5, p, r, /)) 
of the set of "bad" parameter values. Then Xm=i Mn(Cj Pi r i -^l+p) is an upper bound on the 
measure of U^ =1 B n (C, S, p, r, /), which is the set of all parameter values e for which fg has for 
some n an (n, 7^ '(C, (5))-periodic point that is not (n, 7 n (C, <5))-hyperbolic. If this sum converges 
and 

00 

5^ At„(C, P, r, M 1+p ) = p(C, 6, p, r, Af 1+p ) -> as C -> 00 (24) 

n=l 



for every positive /?, (5, and Mi +p , then Theorem 3.2 follows. In the remainder of this announce- 
ment we describe the key construction we use to obtain a bound p n {C, 6, p, r, M\ +p ) that meets 
condition (EHI). 



6 Perturbation of recurrent trajectories by Lagrange in- 
terpolation polynomials 

The approach we take to estimate the measure of "bad" parameter values in the space of per- 
turbations HB N (r) is to choose a coordinate system for this space and for a finite subset of 
the coordinates to estimate the amount that we must change a particular coordinate to make a 
"bad" parameter value "good" . Actually we will choose a coordinate system that depends on a 
particular point xq G B , the idea being to use this coordinate system to estimate the measure 
of "bad" parameter values corresponding to initial conditions in some neighborhood of xq, then 
cover B N with a finite number of such neighborhoods and sum the corresponding estimates. For 
a particular set of initial conditions, a diffeomorphism will be "good" if every point in the set is 
either sufficiently nonperiodic or sufficiently hyperbolic. 
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In order to keep the notations and formulas simple as we formalize this approach, we consider 
the case of 1-dimensional maps, but the reader should always have in mind that our approach 
is designed for multidimensional diffeomorphisms. Let /:/—>/ be a C 1 map on the interval 
I = [—1, 1]. Recall that a trajectory {xk}kez of / is called recurrent if it returns arbitrarily close 
to its initial position — that is, for all 7 > we have \xq — x n \ < 7 for some n > 0. A very basic 
question is how much one should perturb / to make xq periodic. Here is an elementary Closing 
Lemma that gives a simple partial answer to this question. 

Closing Lemma. Let {x^ — f k (x )}^, =0 be a trajectory of length n+1 of a map /:/—►/. Let 
u = (xq — x n )/ IIfe=o ( x n-i ~ x k)- Then xq is a periodic point of period n of the map 



fu{x) = f{x) + u Y[ {x - X k ) 

k=0 



(25) 



Of course f u is close to / if and only if u is sufficiently small, meaning that \xq — x n \ should 
be small compared to JIfc=o l^n— 1 ~ x h\- However, this product is likely to contain small factors 
for recurrent trajectories. In general, it is difficult to control the effect of perturbations for 
recurrent trajectories. The simple reason why is because one can not perturb f at two nearby 
points independently . 

The Closing Lemma above also gives an idea of how much we must change the parameter 
u to make a point xq that is (n, 7)-periodic not be (n, 7)-periodic for a given 7 > 0, which as 
we described above is one way to make a map that is "bad" for the initial condition xo become 
"good" . To make use of our other alternative we must determine how much we need to perturb 
a map / to make a given Xq be [n, 7)-hyperbolic for some 7 > 0. 

Perturbation of hyperbolicity. Let {x k = f h ( x o)}kZo ^ e a trajectory of length n of a C 1 map 
/:/—>/. Then for the map 



n-2 



fv(x) = f{x) + v(x - x n -i) Yl( x ~ x kY 



(26) 



k=0 



such that »gI and 



ICC)'0ro)|-l 



n-2 



n-2 



fe=0 



fe=0 



k=0 



> 7 



(27) 



we have that xq is an (n, 7) -hyperbolic point of f v . 

One more time we can see the product of distances Ilfe=o \ x n-i — x k\ along the trajectory is 
important quantitative characteristic of how much freedom we have to perturb. 

The perturbations ( p5| ) and (|2^) are reminiscent of Lagrange interpolation polynomials. Let 
us put these formulas into a general setting using singularity theory. 

Given n > and a C 1 function / : / — > K we define an associated function j 1 '"/ : I n —> 



l 2n by 

i 1,n /(a;o, 



n -l) = (XO, x n-l,f{x ), /(x n _i), f(x ), f'(x n -i)) . 



(28) 



In singularity theory this function is called the n-tuple 1-jet of /. The ordinary 1-jet of /, usually 
denoted by j 1 f(x) — (x, f(x), f'(x)), maps / to the 1-jet space J rl (/,R) ~ L x M 2 . The product 
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of n copies of ^(/jR), called the multijet space, is denoted by 



J^ n {I.R) =J 1 (J,R) x ••■ x J\I,R), (29) 



and is equivalent to I n x R 2n after rearranging coordinates. The n-tuple 1-jet of / associates 
with each n-tuple of points in I n all the information necessary to determine how close the n-tuple 
is to being a periodic orbit, and if so, how close it is to being nonhyperbolic. 
The set 

A n (I) = {{x , . . .,x n -i} x R 2 ™ C J 1,n (I, R) : 3 i ^ j such that x, = xj} (30) 

is called the diagonal (or sometimes the generalized diagonal) in the space of multijets. In 
singularity theory the space of multijets is defi ned outside of the diagonal A n (I) and is usually 



denoted by J^{I, R) = J 1,n {I, R) \ A ra (7) (see | GG |). It is easy to see that a recurrent trajectory 



{xk}kez + is located in a neighborhood of the diagonal A n (I) in the space of multijets for a 
sufficiently large n. If {xk}^Zo 1S a P ar t °f a recurrent trajectory of length n, then the product 
of distances along the trajectory 



n 

fc=0 



x k \ (31) 



measures how close {xkY^Zo 1S ^° * ne diagonal A n (I), or how independently one can perturb 
points of a trajectory. One can also say that ( |3l| ) is a quantitative characteristic of how recurrent 
a trajectory of length n is. Introduction of this product of distances along a trajectory into the 
analysis of recurrent trajectories is a new point of our paper. 



Lagrange interpolation and blow-up along the diagonal 
in multijet space 



Now we present a construction due to Grigoriev and Yakovenko [G_Y] which puts the "Closing 
Lemma" and "Perturbation of Hyperbolicity" statements above into a general framework. It 
is an interpretation of Lagrange interpolation polynomials as an algebraic blow-up along the 
diagonal in the multijet space. In order to keep the notations and formulas simple we continue 
in this section to consider only the 1-dimensional case. 

Consider the 2n-parameter family of perturbations of a C 1 map / : J — > J by polynomials of 
degree 2n — 1 

2n-l 

f e (x) = f{x) + <t> e (x), <t>e{x) = e ^ fc > ( 32 ) 

fe=0 

where e = (eq, . . . , £2n-i) £ R 2n . The perturbation vector e consists of coordinates from the 
Hilbert brick HB 1 {y) of analytic perturbations defined in Section ||. Our goal now is to describe 
how such perturbations affect the n-tuple 1-jet of /, and since the operator j ,n is linear in /, for 
the time being we consider only the perturbations <f> e and their n-tuple 1-jets. For each n-tuple 
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{ x k}k=o there is a natural transformation J l n : I n x E 2n — > J 1 '™ {I, E) from e-coordinates to 
jet-coordinates, given by 

J" 1,n (a;o,...,a; n _i,e) = j 1 '"^^, . . . , z„_i). (33) 

Instead of working directly with the transformation J 1 ' 71 , we introduce intermediate u- 
coordinates based on Lagrange interpolation polynomials. The relation between e-coordinates 
and u-coordinates is given implicitly by 

2n-\ 2n-l fe-1 

= £kxk = uk ~ «))• ( 34 ) 

fc=0 fe=0 j=0 

Based on this identity we will define functions P 1 -™ : 7™ x E 2 " -> 7" x E 2n and tt 1 '™ : I n x E 2 " -> 
J l n (I, E) so that J' 1 '" = 7T 1 '™ o D 1,n , or in other words the diagram in Figure 1 commutes. We 
will show later that D 1 '™ is invertible, while 7T 1 '" is invertible away from the diagonal A„(7) and 
defines a blow-up along it in the space of multijets J rl, ™(/,E). 

PD 1 '^/,!) = J x ■■■ x I xl 2 " 



n times 




Figure 1: an Algebraic Blow-up along the Diagonal A„(7) 

The intermediate space, which we denote by 27D 1,ra (J, E), is called the space of divided dif- 
ferences and consists of n-tuples of points {xk}^Zo an< ^ ^ n real coefficients {u^f,^ 1 - Here are 
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explicit coordinate-by-coordinate formulas defining 7T 1 '™ : W l n (I, E) — ► J 1,n (I, 
(pe(xa) =u , 

4> e {xi) =U + Ui(xi - X ), 

(j> e {x 2 ) = U + Ui(x 2 - Xq) + U 2 (x 2 - X )(x 2 - Xi), 



:(x n -i) = U + Ui(x n -i — Xo) H h U n -i(x n -i - X ) ■ ■ ■ {X n -i - X n -2), 

(35) 



g /2ri-l 

^^ = ^l 51 ^ ~ ^(mod n)) 

fc=0 j=0 



g /2n-l fc-1 
= ^ I X] Ufc II ( X ~~ ^(mod n)) 

y fc=o j=o 

These formulas are very useful for dynamics. For a given base map / and initial point Xq, 
the image f E (x ) = f(x ) + <j) e (x ) of x depends only on u . Furthermore the image can be set 
to any desired point by choosing u appropriately — we say then that it depends nontrivially 
on u . If x , x\, and w are fixed, the image f s {x\) of x\ depends only on u\, and as long as 
Xq ^ x\ it depends nontrivially on u\. More generally for < k < n — 1, if pairwise distinct 
points {xj}^ =0 and coefficients {uj}^Zo are fixed, then the image f e {xk) of Xk depends only and 
nontrivially on Uk- 

Suppose now that an n-tuple of points {xj}™ =0 not on the diagonal A„(J) and Lagrange 
coefficients {uj}"!^ are fixed. Then derivative f' e {xo) at x depends only and nontrivially on u n . 
Likewise for < k < n — 1, if distinct points {xj}™ =0 and Lagrange coefficients {uj\™l$ 1 are 
fixed, then the derivative f' e (xu) at Xk depends only and nontrivially on u n+ k- 

As Figure 2 illustrates, these considerations show that for any map / and any desired trajec- 
tory of distinct points with any given derivatives along it, one can choose Lagrange coefficients 
{itfej^Q 1 and explicitly construct a map f e = f + <t> £ with such a trajectory. Thus we have shown 
that 7T 1 '" is invertible away from the diagonal A„(7) and defines a blow-up along it in the space 
of multijets J 1 < n {I,^L). 

Next we define the function V l n : I n x E 2 " -> W l n (I, E) explicitly using so-called divided 
differences. Let g : E — > E be a C r function of one real variable. 

Definition 7.1. The first order divided difference of g is defined as 

Ag(x ,x 1 ) = 36 

x\ - x 

for X\ 7^ xo and extended by its limit value as g'(xo) for X\ — x$. Iterating this construction we 
define divided differences of the m-th order for 2 < m < r, 

Am i v _ A m ~ 1 g(a:o, . . .,x m - 2 ,x m ) - A^^jxp, . . . , x m _ 2 , z m -i) 

9\ x o, ■ ■ ■ , x m ) — {o I ) 

Xm X m — 1 

for x m -i ^ x m and extended by its limit value for x m -i = x m . 
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U n +k 



Figure 2: Lagrange coefficients and their action 



A function loses at most one derivative of smoothness with each application of A, so A m g is 
at least C r ~ m if g is C r . Notice that A™ 1 is linear as a function of g, and one can show that it 
is a symmetric function of Xo, ■ ■ ■ ,x m ; in fact, by induction it follows that 

A m g{xo,...,x m )=jr (38) 

Another identity that is proved by induction will be more important for us, namely 

A m x k (x , ■ • • , x m ) = p k ,m{xa, x m ), (39) 

where Pk,m(xo, ■ . . , x m ) is for m > k and for m < k is the sum of all degree k — m monomials 
in xo, ■ ■ ■ , x m with unit coefficients, 

m 

Pk,m(X0,---,X m )= W X 7 ■ ( 40 ) 

r H Yr m =k-m j=0 

The divided differences form coefficients for the Lagrange interpolation formula. For all C°° 
functions g : K — > R we have 

g{x) =A°g(x ) + A 1 g(x , Xi){x - x ) + ■ . ■ 

+ A™ _1 .g(x , . . .,x n -i)(x -x )...(x- x n - 2 ) (41) 
+ A n g(x , x n -!,x)(x -x )...(x- ar„_i) 

identically for all values of x, Xo, ■ ■ ■ , a;„_i. All terms of this representation are polynomial in x 
except for the last one which we view as a remainder term. The sum of the polynomial terms is 
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the degree (n — 1) Lagrange interpolation polynomial for g at {xk}^.Zn ■ To obtain a degree 2n — 1 
interpolation polynomial for g and its derivative at {xfc}]?~Q, we s i m ply use (0) with n replaced 
by 2n and the 2n-tuple of points {xfc( moc i n)}^™^ 1. 

Recall that X* 1 '" was defined implicitly by (|34|). We have described how to use divided 
differences to construct a degree In — 1 interpolating polynomial of the form on the right-hand 
side of ( |34| ) for an arbitrary C°° function g. Our interest then is in the case g — (f> e , which 
as a degree In — 1 polynomial itself will have no remainder term and coincide exactly with the 
interpolating polynomial. Thus D > n is given coordinate-by-coordinate by 



/2n-l 

U. 

\ k=0 



= A ™ ( £fcxfc ) ' ■ ' ' X ™ ( mod »)) 

... . ( 42 ) 

— £m + ^ ' £kPk,m{ x 0j • ■ ■ ? (mod n)) 
fc— m+1 

for rn = 0, . . . , 2n — 1. We call the transformation given by ( fi2"| ) the Lagrange map. Notice that 
for fixed {x^}^™^ 1 , the Lagrange map is linear and given by an upper triangular matrix with 
units on the diagonal. Hence it is Lebesgue volume-preserving and invertible, whether or not 
{x^IZq 1 nes on the diagonal A n (J). 
We call the basis of monomials 

k 

Y[{x- ij(mod n)) for k = 0, . . . , 2n - 1 (43) 

in the space of polynomials of degree 2n — 1 the Lagrange basis defined by the n-tuple {xk}kZo- 
The Lagrange map and the Lagrange basis, and their analogues in dimension TV, are useful tools 
for perturbing trajectories and estimating the measure /^^(C, 5, /?, Mi +p ) of "bad" parameter 
values £€ HB N {r). 



8 Discretization method 

The fundamental problem with using the Lagrange basis to estimate the measure of "bad" 
parameter values, those for which there is an almost periodic point of period n that is not 
sufficiently hyperbolic, is that the Lagrange basis depends on the almost periodic n-tuple {xk}^Zo- 
For a particular "bad" parameter value we can fix this n-tuple and the corresponding Lagrange 
basis, then estimate the measure of the set of parameters for which a nearby n-tuple is both almost 
periodic and not sufficiently hyperbolic. But there are a continuum of possible n-tuples, so how 
can we account for all of the possible cells of "bad" parameter values e within our parameter 
brick HB N (r)? At the beginning of Section ^|, we indicated that for a particular initial condition 
xq we would obtain an estimate on the measure of "bad" parameter values corresponding to an 
almost periodic point in a neighborhood of x$, and thus need only to consider a discrete set of 
initial conditions. But as the parameter vector e varies over HB N (r), there is (for large n at 
least) a wide range of possible length-n trajectories starting from a particular xo, so there is no 
hope of using a single Lagrange basis to estimate even the measure of "bad" parameter values 
corresponding to a single Xo- 

The solution to this problem is to discretize the entire space of n-tuples {xfc}J!~Q, considering 
only those that lie on a particular grid. If we choose the grid spacing small enough, then every 



15 



almost periodic orbit of period n that is not sufficiently hyperbolic will have a corresponding 
pseudotrajectory of length n on the grid that also has small hyperbolicity. In this way we reduce 
the problem to bounding the measure of a set of "bad" parameter values corresponding to a 
particular length n pseudotrajectory, and then summing the bounds over all possible length n 
pseudotrajectories on the chosen grid. 

Returning to the general case of C 1+p diffeomorphisms on B N , where we assume < p < 1, 
the grid spacing we use at stage n is 7„(C, 8, p) = N^ 1 (M^^ n {C, S)) 1 ^, where Ah +p > 1 is a 
bound on the C 1+p norm of the diffeomorphisms fg corresponding to parameters e G HB N (r) . 
This ensures that when rounded off to the nearest grid points {^fcjfcZg, an almost periodic orbit 
of length n becomes an N(M\ +p + 1)7„(C, <5, p)-pseudotrajectory, meaning that \fg{xj) — x.j + i\ < 
N(Mi +p + 1)7„(C, 8, p) for j = 0, 1, . . . , n — 2. ft also ensures that when rounding, the derivative 
dfg changes by at most Af 1 1 ^ n 7„(C, 8), which in turn implies that the change in hyperbolicity 
over all n points is small compared with 7„(C, 8). (Recall that j n (C,S) is our tolerance for 
hyperbolicity at stage n.) 

Roughly speaking, in the case N = 1 our estimate on the measure of "bad" parameter values 
for a particular n-tuple {%k}kZo is then proportional to (7„(C, 8, p)) n 7 n (C, 8), whereas the num- 
ber of possible n-tuples is proportional to (7„(C, 8, p)) ™, making our bound p n (C, 8, p, r, M\ +p ) 
on the total measure of "bad" parameter values at stage N proportional to j n (C, 8). The remain- 
ing problem then is to show that for maps satisfying the Inductive Hypothesis of order n — 1, we 
can bound the proportionality factor in such a way that /i n (C, 8, p, r, M\ +p ) meets the conditions 
prescribed in Section ^|, namely that it be summable over n and that the sum approaches as 
C — > ao. (Notice that the sequence 7 n (C, i5) meets these conditions.) The proportionality factor- 
depends on the product of distances described in Section ^, and in \K4\ we proceed as follows. 
At the nth stage we split length n trajectories of diffeomorphisms satisfying the Inductive Hy- 
pothesis into three groups. One group consists of what we call "simple" trajectories for which 
the product of distances is not too small. For nonsimple trajectories we show that either the 
trajectory is sufficiently hyperbolic by the Inductive Hypothesis (second group) or the trajectory 
returns very close to itself before the nth iteration and is simple (not recurrent) up to this point 
(third group). In the latter case, perturbation by Lagrange polynomials of order lower than n at 
the point of a close return can control the behavior of that trajectory up to length n. 

Notice that in the preceding paragraph, even if the product of distances is not small, the 
proportionality factor in our estimate on the measure of "bad" parameter values for a given 
n-tuple {xk}kZa is large because the parameter measure is normalized to be 1 on a brick 
HB l (v) whose sides decay rapidly; the normalization increases the measure by a factor of 
fafi ■ • ■ T n -\Tm-\- However, we are able to show that when considering only diffeomorphisms 
fg with e £ iJB 1 (r), the number of n-tuples we must consider as possible pseudotrajectories of 
fg is reduced by the factor r$r\ ■ ■ ■ r„_2- Due to our definition of an admissible sequence r, the 
remaining factor r„_ir2„-i does not affect the necessary summability properties for the bounds 
p n (C, 8, p, r, M\ +p ). There is an additional distortion of our estimates that is exponential in n, 
due to the fact that an image of a finite-dimensional brick of e-parameters under the Lagrange 
map is a parallelepiped of it-parameters, but no longer a brick. This exponential factor is also 
not problematic, because our bound /i n (C, 8, p, r, Mi +p ) decays superexponentially in n. 
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9 Conclusion 



In this announcement we have only been able to outline some of the fundamental tools that are 
needed for the proof of the main result, which will appear in [K5| and [KH|. Here we list some 
of major difficulties appearing in the proof. 

• We must handle almost periodic trajectories of length n that have a close return after k < n 
iterates, so that as discussed above the product of distances along the trajectory is small. The 
precise definition of a close return is a major problem here. It must not be too restrictive, because 
we must also show that a trajectory without close returns is simple (the product of distances is 
not too small) 

• In dimension N > 1, the Lagrange interpolation polynomials involve products of differences 
of coordinates of points, which may be small even though the points themselves are not close. 
Thus we must be careful about how we construct the Lagrange basis for a given n-tuple of 
points xq, . . . , x n -\ C B N and how to incorporate this into the general framework of the space 
of Lagrange interpolation polynomials. 

• At n-th stage of the induction we need to deal with the (2rt) Ar -dimensional space W<2n-i,;v 
of polynomials of degree 2rt — 1 in N variables and handle the distortion properties of the Lagrange 
map. In such a large dimensional space, even the ratio of volumes of the unit ball and the unit 
cube is of order (2n.) w ( 2n ) [Sarfl . 

In K5], [KH], based on |K4| , we first prove the main 1-dimensional result for the case N = 
p = 1, discussed in Sections 5, 6, and 7 of this announcement, and then using additional tools 
and ideas complete the proof in the general case. 



Appendix: DifT^) and DiflT(M) 

Given a smooth (C°°) compact manifold M of dimension D, for N > 2D the Whitney Embedding 
Theorem says that a generic smooth function from M to is a a diffeomorphism between M and 
its image. To simplify notation, we identify M with its image, so that M becomes a submanifold 
ofJR^. 

Let U C M. N be a closed neighborhood of M, chosen sufficiently small that there is a well- 
defined projection ir : U — > M for which tt(x) is the closest point in M to x. Then for each 
y G M, 7r _1 (y) is an (N — Z?)-dimensional disk. For < p < 1 and y, z £ M choose a linear 
function g Py y, z ■ 7T ^ 1 (y) 7T ~ 1 ( Z ) that maps y to z and contracts distances by a factor of p, and 
such that the dependence of g P , y ,z on y and z is C r . Then we can extend each / 6 DifF(M) to 
a function F € C r (U) that is a diffeomorphism from U to a subset of its interior by letting 

F(x) = g p ^(x).j(*(x)){x) 

where p = ||/" 1 || c r/2. Then by Fenichel's Theorem |J, every sufficiently small perturbation 
F £ G C r (U) of such an F has an invariant manifold M e G U for which tt\m e is a C r diffeomorphism 
from M e to M. Then to such an F £ we can associate a diffeomorphism f s G DifF (M) by letting 

/ 6 (V) =7^(7^6,))). 

Notice that the periodic points of F e all lie on M s and are in one-to-one correspondence with 
the periodic points of f s . Furthermore, because f £ and F £ \m c are conjugate, the hyperbolicity of 

4 This is exactly the place in the proof where we need to impose superexponential decay of our bounds on 
hyperbolicity and periodicity. 
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each periodic orbit is the same for either map. Thus any estimate on P n {F e ) or "f n {F £ ) applies 
also to f e . 
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